One-Class Classification by Combining Density and Class Probability Estimation
نویسندگان
چکیده
One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using either density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we present a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution, with the induction of a standard model for class probability estimation. In our method, the reference distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form an adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem, we show that the combined model, consisting of both a density estimator and a class probability estimator, can improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines.
منابع مشابه
Classifier-Adjusted Density Estimation for Anomaly Detection and One-Class Classification
Density estimation methods are often regarded as unsuitable for anomaly detection in high-dimensional data due to the difficulty of estimating multivariate probability distributions. Instead, the scores from popular distanceand localdensity-based methods, such as local outlier factor (LOF), are used as surrogates for probability densities. We question this infeasibility assumption and explore a...
متن کاملAccounting for secondary variable for the classification of mineral resources using co-kriging technique; a Case study of Sarcheshmeh porphyry copper deposit
Due to substantial effect of classification of resource models on future mine planning, one should come with an accurate method of estimation to guarantee that the minimum error is acquired in the estimation process. The known world class Cu-Mo deposit, Sarcheshmeh Porphyry deposit (central Iran) selected as the study area. The Hypogene zone of the deposit was chosen as the space in which estim...
متن کاملAnalysis of a Fusion Method for Combining Marginal Classifiers
The use of multiple features by a classifier often leads to a reduced probability of error, but the design of an optimal Bayesian classifier for multiple features is dependent on the estimation of multidimensional joint probability density functions and therefore requires a design sample size that, in general, increases exponentially with the number of dimensions. The classification method desc...
متن کاملStudying Effectiveness of Landsat ETM+ Satellite Images Classification Methods in Identification of desert pavements (Case study: South of Semnan)
Extended abstract 1- Introduction The process of identifying landforms is a subject that has been researched by many researchers. All the definitions of geomorphology emphasize the study and identification of landforms. Understanding landforms and how they are distributed are some sort of essential requirements in applied geomorphology and other environmental sciences (Shayan et al., 2012). O...
متن کاملOverriding the Experts: A Fusion Method for Combining Marginal Classifiers
The design of an optimal Bayesian classifier for multiple features is dependent on the estimation of multidimensional joint probability density functions and therefore requires a design sample size that increases exponentially with the number of dimensions. A method was developed that combines classification decisions from marginal density functions using an additional classifier. Unlike voting...
متن کامل